Viewpoint-based Text Categorization and Summarization

نویسنده

  • Atsushi Fujii
چکیده

科学技術や文化の急速な発展によって,言葉や事柄についてWorld Wide Web上のツールを用い て調べる機会が増えている.検索エンジンは情報の量が多いものの,情報が統制されておらず質が 低い.人手で編集する事典は情報の質が高いものの,情報の量が制限される.両者の長所を統合 するために,筆者らは,Web情報や特許情報から説明テキストを抽出し,体系化する研究を行っ ている.本研究は,ある見出し語について説明した複数のテキストを観点に基づいて分類するこ とで,多面的な要約を生成する手法を提案する.動物名や病名といった見出し語の種類によって 説明に必要な観点が異なるため,人手による手法では大規模化が困難である.そこで,Wikipedia から見出し語の種類ごとに観点の構造に関するテンプレートを抽出する.さらに,Wikipediaの 記事を機械学習のデータとして利用して,与えられた説明テキストを適切な観点に分類する.評 価実験によって本手法の有効性を示す.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Text Summarization Using Cuckoo Search Optimization Algorithm

Today, with rapid growth of the World Wide Web and creation of Internet sites and online text resources, text summarization issue is highly attended by various researchers. Extractive-based text summarization is an important summarization method which is included of selecting the top representative sentences from the input document. When, we are facing into large data volume documents, the extr...

متن کامل

Test Model for Text Categorization and Text Summarization

Abstract—Text Categorization is the task of automatically sorting a set of documents into categories from a predefined set and Text Summarization is a brief and accurate representation of input text such that the output covers the most important concepts of the source in a condensed manner. Document Summarization is an emerging technique for understanding the main purpose of any kind of documen...

متن کامل

EXTRACTION-BASED TEXT SUMMARIZATION USING FUZZY ANALYSIS

Due to the explosive growth of the world-wide web, automatictext summarization has become an essential tool for web users. In this paperwe present a novel approach for creating text summaries. Using fuzzy logicand word-net, our model extracts the most relevant sentences from an originaldocument. The approach utilizes fuzzy measures and inference on theextracted textual information from the docu...

متن کامل

A Text Categorization Based On A Summarization Extraction

We propose a new approach to text categorization based upon the ideas of summarization. It combines word-based frequency and position method to get categorization knowledge from the title field only. Experimental results indicate that summarization-based categorization can achieve acceptable performance on Reuters news corpus.

متن کامل

Systematic literature review of fuzzy logic based text summarization

Information Overloadrq  is not a new term but with the massive development in technology which enables anytime, anywhere, easy and unlimited access; participation & publishing of information has consequently escalated its impact. Assisting userslq    informational searches with reduced reading surfing time by extracting and evaluating accurate, authentic & relevant information are the primary c...

متن کامل

A survey on Automatic Text Summarization

Text summarization endeavors to produce a summary version of a text, while maintaining the original ideas. The textual content on the web, in particular, is growing at an exponential rate. The ability to decipher through such massive amount of data, in order to extract the useful information, is a major undertaking and requires an automatic mechanism to aid with the extant repository of informa...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008